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Abstract 



This contribution to the Italian "Workshop sui Monte Carlo, la Fisica e le Simulazioni a 
LHC", held at LNF, Frascati, in February, May and October 2006, summarises the status of 
parton density functions (PDF's) and the impact of their uncertainties on the LHC physics. 
Emphasis is given to methods of contraining PDF's using LHC data. Moreover, the advan- 
tages of the so-called PDF reweighting technique, which enables to quickly estimate PDF 
uncertainties with Monte Carlo events, are also presented. 




Figure 1: 

The Q 2 -x kinematic plane for the LHC and previous experiments, showing the mass (M) and rapidity (y) 
dependence. 

1 Introduction 

The start up of the LHC machine is now imminent and theorists and experimentalists are converging their 
efforts to enhance the LHC discovery potential. This implies minimising theoretical and experimental 
uncertainties. Among the theoretical uncertainties the knowledge of the proton structure plays a major 
role: the accurate evaluation of parton density functions (PDF's) is vital to provide reliable predictions 
of new physics signals (i.e. Higgs, Supersymmetry, Extra Dimensions etc.) and their background cross 
sections at the LHC. As shown in the contribution by C. Mariotti, E. Migliore and P. Nason , at hadron 
colliders the inclusive cross section for hard production processes is the convolution of the cross sec- 
tion at parton level, calculable at fixed order in perturbation theory, and the parton densities of the two 
interacting partons. 

Our knowledge of the proton structure is improving fast thanks to more experimental data being 
available and thanks to more precise and sophisticated theoretical calculations: PDF's are nowadays 
available up to the next-to-next-to leading order (NNLO) in perturbative QCD and in recent years they 
have been also providing uncertainties which take into account experimental systematic errors and the 
correlations between data points that enter the global fits. Despite the great improvement on PDF's in 
recent years, their uncertainty dominates many cross section calculations for the LHC. As visible in 
fig. CD the LHC will probe kinematic regions in x (parton momentum fraction) and Q 2 (hard scattering 
scale) never explored before, such as the very high-Q 2 and the very low-x regions. At low-A" the current 
theoretical formalism (DGLAP) is at the edge of its supposed applicability. For the production of Z and 
W bosons the participating partons have small momentum fractions at central rapidity, x ~ 10 -3 , and 
in the whole measurable rapidity region, \y\ < 2.5, they are within the range 1(T 4 < x < 0.1. Thus, at 
the electro-weak scale the theoretical predictions for the LHC cross sections are dominated by 1ow-a 
PDF uncertainty. At the TeV scale, where we expect new physics, the interacting partons have higher 
momentum fractions and very high Q 2 (> 10 6 GeV 2 ). Thus, at the TeV scale the cross section predictions 
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are dominated by high-* PDF uncertainty and rely on the extrapolation of the DGLAP equations. In both 
kinematic regimes the gluon density, which is in most regions the less well constrained density function, 
plays a major part: at low x the gluon density dominates the quark and anti-quark densities, at high Q 2 
the interacting partons get an important contribution from the sea, which is driven by the gluon density, 
via the g — » qq splitting process. For a review on hard interactions of quarks and gluons at the LHC refer 
to [1]. 

Past and running experiments, such as HERA, have been providing vital information to improve 
our knowledge of the parton densities, however the broad kinematic region of the LHC forces (and 
offers a unique opportunity to) ATLAS and CMS experiments to use their own data to constrain the 
parton densities, in particular the gluon, in the kinematic regions where they are not sufficiently well 
determined. In section 0] it will be shown that significant improvement on PDF fits can be made with 
LHC data. 

2 Global fits and error analysis 

Perturbative QCD provides the evolution equations for the parton densities, DGLAP equations, but does 
not provide us with their analytic forms as function of x. The most common approach to extrapolate 
PDF's as function of x and Q 2 consists in solving the DGLAP equations by parameterising the parton 
densities qi(x) at a fixed scale Qq = 1— 7 GeV 2 , applying assumptions and constraints derived from 
theory and measurements. Then, with the DGLAP equations, we numerically extrapolate the values of 
qi(x, Q 2 ) to different values of Q 2 and a global fit of experimental data is performed. For valence quarks 
the parameterisations have usually this behaviour qy « xr ( 1 — x) 11 , whereas for the gluon and sea quarks 
they are of this kind qs(g) ~ x (1 —x)* 1 . However there is no unanimous agreement on the parametric 
functions to use and on the number of free parameters. For a review refer to [2]. 

Different regions in the x, Q 2 plane and also different partonic components are probed by the available 
world experimental data. These include DIS data from fixed target experiments and HERA, Drell-Yan 
data, inclusive jet production and W charge asymmetry from Tevatron. 

There are various groups who are fitting the proton structure function data, among them CTEQ and 
MRST. Recent PDF sets include in their analyses up-to-date experimental data and attempt to provide 
coherent estimates of the uncertainties, including experimental correlated systematic errors. The differ- 
ences between these PDF sets can be summarised in three categories: different choices of input data sets, 
different theoretical model assumptions and different error analyses. 

There are many sources of uncertainty which contribute to a global fit uncertainty. These can have 
experimental and theoretical origins. The former are related to the data errors which enter the fit, the latter 
are due to the model uncertainties of the theoretical framework. The theoretical uncertainties concern 
both the non-perturbative (parameterisations) and perturbative parts of the calculations: assumptions 
imposed to limit the number of free parameters, higher order truncations in the DGLAP formalism etc. 

The treatment of the experimental uncertainties, especially the correlated systematic uncertainties, is 
a complex subject which is partly still under debate. A modified version of the standard % 2 method is used 
to take into account non-Gaussian systematic errors and their correlations: % 2 — > % 2 + AT 2 , where AT 
is the so-called "tolerance", a complicated mathematical expression that includes correlated systematic 
terms [2]. There are then two methods to compute the central values of the theoretical PDF parameters 
and their uncertainties: the offset and the Hessian method. In the offset method the correlated systematic 
errors affect only the determination of the PDF uncertainty, not the best fit. This method is used for 
ZEUS PDF's. Conversely in the Hessian method, used by CTEQ and MRST groups, the collective effect 
of the correlated systematic errors has also an impact on the best fit. 

For both, the offset and the Hessian methods, the PDF uncertainty is conventionally computed along 
the eigenvectors of the diagonalised covariance or Hessian matrices. The number of eigenvectors corre- 
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Figure 2: 

Left: CTEQ6.5M set at Q 2 ~ M^. Right: comparison between CTEQ6.5M (black) and MRST2004NLO 
(red) gluon PDF's and their uncertainties. 



sponds to the number of free parameters in the parton density parameterisations. Contemporary PDF sets 
provide a central value PDF set, corresponding to the best data fit, and two PDF sets for each uncertainty 
eigenvector, giving the upper and lower limit on the uncertainty. Given a PDF set, the upper limit of the 
PDF uncertainty is calculated for a physical observable by adding in quadrature the upward displacement 
eigenvectors, whereas the lower limit by adding in quadrature the downward displacement eigenvectors. 
MRST group has chosen 15 free parameters, leading to 30 error sets; CTEQ6 has 20 free parameters 
and 40 error sets. Fig. [2] shows CTEQ6.5 fit for all parton densities at the scale Q 2 ~ and its gluon 
uncertainty compared to the MRST2004NLO gluon best fit. 

3 Impact of PDF uncertainty on LHC physics 

The experience from previous experiments teaches that the PDF uncertainties must be properly taken into 
account or features of the SM physics can be misinterpreted as evidence of new physics. For example 
an unexplained discrepancy between data and theory was originally found in the Tevatron Run-I jet data, 
which was subsequently reabsorbed within the theoretical uncertainty when a more accurate PDF error 
analysis was performed. 

G. Polesello's contribution on inclusive jet cross-section has shown that the PDF uncertainty is dom- 
inating for high Ej jets over the renormalisation/factorisation scale and the experimental energy scale 
uncertainties: 10% at 1 TeV, 25% at 2 TeV, 60% at 5 TeV. 

3.0.1 Extra dimensions 

In extra dimensions models, if the compactification scale Mq is about few TeV0, it is possible to observe 
the production of gravitons and Kaluza Klein (KK) excitations at the LHC. If gauge bosons can propagate 

''in this context the compactification scale is defined as Mq = l/Rc where Rq is the compactification radius of the extra 
dimensions on a hypersphere. 
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Figure 3: 

CTEQ6.1 uncertainty on distributions of the high-mass di-electron Mu (left and centre) and rapidity 
y (right). Herwig+ Jimmy generation and ATLAS full simulation [5], N.B.: the drop in the low Mu 
spectrum is an artifact of the event selection in the Monte Carlo. 

in the extra dimensions, we also expect a violation of the SM logarithmic behaviour of the running 
couplings. In this scenario, if we consider the CTEQ6M PDF uncertainty on the di-jet cross-section, 
we see the extra dimensions prediction being absorbed within the SM prediction zone: the high-x gluon 
uncertainty can cause a decrease of the discovery reach from Mc = 5 (10) TeV to Mq < 2 (3) TeV, 
depending on the number of extra dimensions [3]. 

3.0.2 Higgs 

The accurate measurements of the Higgs production cross sections and decay branching ratios are crucial 
to explore all Higgs boson fundamental properties. At the same time, we need very precise estimates of 
the various theoretical uncertainties. 

It is found that the PDF uncertainty can be of the same order of magnitude or even higher than 
the other theoretical uncertainties. In fact the perturbative calculations of Higgs production cross sec- 
tion are becoming more stable as higher orders are included, leaving the PDF uncertainty as one of the 
largest contributions to the total theoretical uncertainty. For example for the dominant Higgs production 
channel, gg — > H, the PDF uncertainty on gg luminosity, can be larger than the factorisation and renor- 
malisation scale uncertainty: in fact the differences in the gg luminosity prediction between MRST2002 
and Alekhin2002 can be higher than 10% for low Higgs mass scenarios. Furthermore, studying the 
effect of three different PDF sets (i.e. CTEQ6M, MRST2001E and Alekhin2002) with their quoted un- 
certainties, on various Higgs productions channels, we see that the PDF uncertainty can be of the order 
of ~ 10 — 15% on the production cross-section [4]. 

3.0.3 High mass Drell-Yan 

Several new physics models predict events with two charged leptons originating from the decay of a 
massive object. A peak in the da/dM distribution is a clean signature of a new resonance: the identifica- 
tion and reconstruction of high-mass di-lepton final states can be done with high efficiency and the SM 
background can be small. However the shape and normalisation of the predicted observable distributions 
depend on PDF and its uncertainty. 
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In fig. [3] we see the total CTEQ6. 1 uncertainty on the distributions of the reconstructed rapidity y 
and invariant mass Mu of the lepton pair: 40 CTEQ error sets have been accounted for, applying the 
PDF reweighting technique (see sec. 13). The uncertainty is in the range 4 — 7% on both y and Mu up 
to 1 TeV. Excluding the bins at the edge of the rapidity distributions, where statistical fluctuations are 
present, we see that the largest PDF uncertainty is at y ~ 0. As explained in [5], a study shows that NLO 
QCD corrections, applied on Monte Carlo (MC) and on PDF, enhance the cross section with respect to 
the LO prediction by 24 — 36%, with the largest NLO corrections at y ~ 0. A discrepancy of about 6% 
is found between MRST-NLO and CTEQ-NLO PDF's. 

4 How to constrain PDF at LHC 

Several Standard Model processes are under study to constrain partem densities: the productions of y, W 
and Z bosons and inclusive jets are equally important to constrain the parton densities and in particular 
the gluon density in complementary kinematic regions (see [6]). 

In G. Polesello's contribution we appreciate how the LHC jet data can be used to better constrain PDF 
fits: if the experimental systematic uncertainty is under control to < 10% level, LHC jet data can sig- 
nificantly contribute to constraining the high-* gluon density with 1 fb~ l luminosity. Other studies [7] 
have also shown that the prompt photon production process is extremely sensitive to PDF differences 
and can probe the perturbative theory of the gluon at high-x: the discrepancy between MRST2004-NLO, 
CTEQ6.1M and older PDF sets can be of the order of 16 — 18% on the photon 77 and pj distributions. 
Furthermore, the bg — > Zb process is sensitive to the &-quark content of the proton and the LHC predic- 
tions for the Z + b cross-section, using different PDF sets, are ±5 — 10% [8]. 

4.1 W rapidity distributions 

A few days of LHC running at the nominal low luminosity (10 33 cm~ 2 s ) are sufficient to make the 
statistical uncertainty negligible with respect to the systematic uncertainties on W cross section. Among 
the systematic uncertainties there are experimental and theoretical contributions. 

The ATLAS strategy for selecting W bosons consist of identifying an isolated and highly energetic 
lepton, Ej > 25 GeV, and requiring a large amount of missing energy in the event due to the neutrino es- 
caping detection, Ej > 25 GeV. The analysis of W — * ev e events fully simulated in the ATLAS detector, 
in the early data scenario, shows that the W boson is a very clean signature: the trigger and the electron 
off-line identification with the electron Ej and Ej cuts leave a background contamination dominated by 
QCD events (less than 5%) and W — > tv t (about 0.5%). If a jet veto cut is added, the QCD background 
can be further reduced to a level of < 1% [9]. Therefore the W sector is an ideal environment to study 
and constrain theoretical and experimental systematics. 

4.1.1 Higher order corrections 

The differential cross section da/dy for W production has been calculated to the NNLO order in QCD 
with an energy scale uncertainty of < 1% [10]. With this level of precision in perturbative QCD cal- 
culations, the electro-weak (EW) contributions are no more negligible. As presented in this workshop, 
leading order electro-weak contributions with multi-photon radiation introduce corrections of the order 
of few percent on W boson cross-sections. The EW corrections, computed by the program HORACE 
interfaced to HERWIG in the a(0) scheme in the muon channel [11], are constant in rapidity and are 
about 3.5% for a cut on the muon transverse momentum of pj > 25 GeV and can be up to 5.2% for loser 
Pt cuts. The dependence on the muon charge is negligible (up to 0.4% for lose pj cuts) [12]. Consid- 
ering that these corrections in the muon channel are flat in rapidity and negligible on the muon-charge 
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Figure 4: 

HERWIG simulations of e ± from W ± decay, with CTEQ6.1M (red), MRST2001 (black) and ZEUS- 
S (green) PDF's and their quoted uncertainties (estimated with the PDF reweighting technique as in 
sec. 12]). The top plots are at generator level, the bottom plots at ATLFAST detector level. Left fig: e~ 
(left plots) and e + (right plots) rapidity spectra with NLO-QCD corrections. Right fig: electron-charge 
asymmetry [13 ]. 

asymmetry, we can state that they do not have an impact on the PDF extraction, however they are relevant 
for luminosity measurements in order to achieve a precision of 6% or better. The electron channel needs 
further investigation. 

4.1.2 PDF uncertainty on W ± rapidity distribution. 

From fig. 0] we can see the full PDF uncertainties for three different PDF analyses, on the rapidity dis- 
tribution of e^ originating from W ± decays. Their predictions are compatible within their uncertainties, 
which are in the range 4% — 12%, and are dominated by the gluon density. 

In a previous paper [14] it is demonstrated that the LHC can improve the current constraint on the 
low-* gluon parameter X g (xg(x) « x~ by more than 41 % by fitting the e + and e~ rapidity distributions, 

if their experimental systematic uncertainties are kept under 5% level. 

, i+ , i~ , [+ , i~ 
In the lepton-charge asymmetry A/ = (|^. — 12 )/(^ ) most of the gluon uncertainty 

cancel out leaving the valence up (uy) and down (dy) densities as main contributions to the total PDF 

uncertainty, which is reduced to ~ 5% at r\ w 0. However a discrepancy of ~ 15% is present at T] ~ 

between the MRST2002 and other two PDF's, CTEQ6.1M and ZEUS-S [15]. In fact the MRST PDF's 

prediction for uy — dy valence density is different from the other PDF's and is outside the quoted PDF 

uncertainty bands. This difference in current PDF fits comes from the lack of data on valence quantities 

at such low-*. The LHC can be the first experiment to perform such measurement in the kinematic region 

x«l(T 3 and£ 2 =M^. 

4.1.3 A posteriori inclusion of PDF's in NLO calculations. 

The MC computation of QCD final state observables to NLO is a lengthy process. In order to study 
the impact of PDF uncertainties on QCD cross section measurements in a faster way and allow for PDF 
fitting of these quantities, the technique of "a posteriori" inclusion of PDF's in NLO calculations has 
been developed for LHC processes [16] [17]. A MC run is used to generate a grid (in x\, X2 and Q) of 



cross section weights that can subsequently be combined with an arbitrary PDF set. This enables the 
decoupling of the lengthy calculation of perturbative MC weights from the convolution with the partem 
densities. Perturbative coefficients for jet (using NLOJET++), W and Z boson (using MCFM) production 
processes can be collected on a grid with an accuracy better than 0.02%. 

5 PDF reweighting of Monte Carlo events 

The computation of the full PDF uncertainty on a physics process is a cumbersome procedure. Given 
one PDF set, such as CTEQ or MRST, it requires the generation of twice as many MC samples as the 
number of free parameters in the global fit. Furthermore one error analysis might not be sufficient since, 
as seen above, there can be large discrepancies between the results of different error analyses. 

A PDF reweighting technique has been studied and tested, requiring only one Monte Carlo generation 
with one conventional PDF set §[13] [14]. 

This technique has been implemented using hard process parameters of the MC generation: flavours 
(flav\ and flav-i) and momentum fractions of the interacting partons Xfl avi , Xfi aV2 an d the energy scale 
Q. The PDF set used for the MC generation is named PDF\ . 

The PDF reweighting technique consists of evaluating, on the event-by-event basis, the probability 
of picking up the same flavoured partons with the same momentum fractions Xfi clVl , Xfi aV2 , according to 
a second PDF set, PDF2, at the same energy scale Q, then evaluating the following ratio 

Z7 + W7 ' U* fPDF 2 (Xflavi,Q) fpDF 2 (xflav 2 ,Q) ns 

Event Weight = — ■ — . (1) 

JPDF t (Xflav i,Q) JPDFi {X flav 2 , Q ) 

After the Event Weight is applied on MC events generated with PDF\ , they will effectively be dis- 
tributed according to PDF2. 

This technique has been tested using HERWIG (for inclusive W production) and ALPGEN interfaced 
to HERWIG (for W+jets production) as Monte Carlo generators and with various recent PDF sets. Sim- 
ilar results have been obtained with these two MC generators and with different PDF sets, as discussed 
below. Fig. \5\ shows the accuracy of this technique using HERWIG: the bias over the all y range is of 
the order of 0.5% or less and there is no evidence of y dependence. Comparing the bottom plots on the 
right and left hand sides of fig. \5\ we see that the PDF reweighting technique corrects for the difference 
in normalisation between PDF\ and PDF2 and corrects for the y modulation. 

This technique can be used to estimate the full PDF uncertainty, starting from one sample of MC 
generated events, for distributions that are determined by the MC hard process. 
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